Can a syntactic language model be incorporated into an HMM-based ASR framework?
نویسنده
چکیده
Automatic speech recognition (ASR) is the process which a computer interprets human speech into some kind of meaningful representation. Usually, the task for the automatic speech recognition is to identify the correct wordsequence. Speech recognizer is used in different applications for example simple command-control programs, transcription and speech understanding. A good automatic speech recognizer should be able to recognize spontaneous continuous speech and should not require the speaker to break up their speech into discrete words. The task of recognizing speech from newspapers or news broadcast with a state of the art speech recognition systems can be done with accuracy higher than 90 percent. The accuracy drops significant when using spontaneous speech, due to the fact that the acoustic and language models usually have been built using written language or speech from written language [9]. This term paper is a literature review in the area of automatic speech recognizer and we will investigate different techniques to incorporate syntactic information into the speech recognizer. How can a speech recognizer benefit from syntactic information? Is it possible to integrate a parser? The paper is organized as follows. We begin with a brief overview of automatic speech recognition. Section 3 investigates different methods of integrating language models. Finally, the paper ends with conclusions.
منابع مشابه
Improving grapheme-based ASR by probabilistic lexical modeling approach
There is growing interest in using graphemes as subword units, especially in the context of the rapid development of hidden Markov model (HMM) based automatic speech recognition (ASR) system, as it eliminates the need to build a phoneme pronunciation lexicon. However, directly modeling the relationship between acoustic feature observations and grapheme states may not be always trivial. It usual...
متن کاملHybrid System of Optimal Self Organizing Maps and Hidden Markov Model for Arabic Digits Recognition
Thanks to Automatic Speech Recognition (ASR), a lot of machines can nowadays emulate human being ability to understand and speak natural language. However, ASR problematic could be as interesting as it is difficult. Its difficulty is precisely due to the complexity of speech processing, which takes into consideration many aspects: acoustic, phonetic, syntactic, etc. Thus, the most commonly used...
متن کاملOn recognition of non-native speech using probabilistic lexical model
Despite various advances in automatic speech recognition (ASR) technology, recognition of speech uttered by non-native speakers is still a challenging problem. In this paper, we investigate the role of different factors such as type of lexical model and choice of acoustic units in recognition of speech uttered by non-native speakers. More precisely, we investigate the influence of the probabili...
متن کاملMultilingual speech recognition A posterior based approach
Modern automatic speech recognition (ASR) systems are based on parametric statistical models such as hidden Markov models (HMMs), exploiting 1) acoustic-phonetic models, which need to be trained on large amount of acoustic data, 2) a language model, which needs to be trained on large amount of text data and, finally, 3) a lexicon with phonetic transcription which requires linguistic expertise. ...
متن کاملNoise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory
This article proposes and evaluates various methods to integrate the concept of bidirectional Long Short-Term Memory (BLSTM) temporal context modeling into a system for automatic speech recognition (ASR) in noisy and reverberated environments. Building on recent advances in Long Short-Term Memory architectures for ASR, we design a novel front-end for contextsensitive Tandem feature extraction a...
متن کامل